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IMAGE COMPRESSION METHOD CAPABLE OF 
REDUCING TILE BOUNDARY DISTORTION 

[0001] The present application claims priority to the corresponding Japanese Application 

No. 2003-097151, filed on March 31, 2003, the entire contents of which are hereby incorporated 
by reference. 

RACKCWOITND OF THF INVFNTTON 

[0002] The present invention generally relates to compression of images and more 

particularly to a code discarding processing used in an image compression process conducted by 
an image encoding algorithm that divides an image into tiles and decomposes each tile into sub 
bands by frequency conversion and further conducts bit-plane encoding of the coefficients for the 
sub-bands in each encoding unit. 

[0003] As an example of this type of image encoding process, a JPEG 2000 encoding 

algorithm is known. It should be noted that the JPEG-2000 algorithm has become an 
international standard in 2001 (basic specification: ISO/IEC FCD 15,444-1). For more 
information about JPEG 2000, reference should be made to Yasuyuki Nomizu, "JPEG 2000, 
Next Generation Image Encoding Method," February 13, 2001 Triceps Co., Ltd. Further, motion 
picture processing of JPEG 2000 is defined in the expansion of JPEG 2000 (ISO/IEC FCD 
15,444-3). 

[0004] In the motion picture processing of the JPEG 2000 algorithm, each of the time- 

sequentially continuing still pictures is treated as a frame, and each frame is encoded by an 
algorithm defined in the basic specification of JPEG 2000, independently to the frames before or 
after the frame under consideration. Thus, each motion picture frame of JPEG 2000 can be 
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treated similarly to a still picture. 

[0005] In JPEG 2000, an image is divided into non-overlapping rectangular regions 

called tiles, and in each tile, the image is decomposed into plural sub bands as a result of 
frequency conversion (two-dimensional discrete wavelet transformation). Further, the obtained 
wavelet coefficients are entropy encoded for each sub band, after quantization. In this entropy 
encoding, a block based bit plane encoding technology called EBCOT, which includes dividing 
into blocks, coefficient modeling, and binary arithmetic encoding, is used. 
[0006] In JPEG 2000, on the other hand, there exits a problem in that discontinuity of 

images becomes observable in the decompressed image at the tile boundary in the case the 
compression ratio is increased (tile boundary distortion). It should be noted that such tile 
boundary distortion is caused primarily as a result of an increase in the quantization error of the 
wavelet coefficients at the tile edge, while such an increase of the quantization error is caused as 
a result of the procedure of interpolating the picture element values or coefficient values by using 
a process called mirroring at the time of performing the wavelet transformation of each tile. It 
should be noted that there is a shortage of picture elements occurs at the tile edge, and the 
mirroring technique is used in view of such shortage of the picture elements. 
[0007] Based upon this recognition, there is proposed a method of suppressing the tile 

boundary distortion as disclosed in the United States Patent No. 5,710,835 in which the division 
of an image into the tiles is conducted such that the adjacent tiles overlap with each other. 
Further, there is a known procedure of decreasing the quantization rate at the tile boundary as 
compared with the quantization rate for other parts of the image. Reference should be made to 
Japanese Laid Open Patent Application 2002-374532. 

[0008] Another reason the tile boundary distortion is caused is the influence of the code 
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discard processing. 

[0009] In JPEG 2000, it should be noted that the coefficients of each sub band are 

encoded by every encoding unit called code a block, which is used as the unit of encoding 
processing, wherein the encoding of the coefficients is conducted for each bit plane starting from 
the upper most bit toward the lower most bit. In this encoding process, there is conducted a 
process for selectively discarding the codes of each bit plane in each code block, until a desired 
compression ratio (code amount) is achieved. This process of discarding the code is equivalent to 
the quantization of the coefficients, and thus, this process is called post quantization. 
[0010] In such a procedure of discarding the codes, the importance of the code is 

evaluated for the bit planes in each of the code blocks, and the discarding of the code is 
conducted consecutively, starting from the code having the least significant importance. With 
regard to the evaluation method of the importance of the code, various methods are proposed, 
including the one described in D. S. Taubman and M. W. Marcellin, "JPEG2000: Image 
Compression, Fundamentals, Standards and Practice," Kluwer, 2002. Further, with regard to the 
implementation method for discarding the code starting from the one having the least significant 
importance, it is typical to use a process conducted by the Lagrange's method of undetermined 
multipliers, as is described in D. S. Taubman and M. W. Marcellin, op. cit. 
[0011] However, even in the case where the process disclosed in the United States Patent 

5,710,835 or Japanese Laid Open Patent Application 2002-374532 is employed, there still arises 
cases in which the tile boundary distortion cannot be suppressed sufficiently due to the effect of 
the foregoing code discarding process. 
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SUMMARY OF THF TNVFNTTON 

[0012] An image compression method and apparatus capable of reducing tile boundary 

distortion is described. In one embodiment, the image compressing apparatus divides an image 
into plural tiles, decomposing each of the tiles into plural sub bands by frequency conversion, 
and applies bit plane encoding to each of the sub bands for each encoding unit. The image 
compresses apparatus comprises a code discarding unit to selectively discard a code obtained by 
the bit plane encoding for each encoding unit, where the code discarding unit includes a discard 
amount setting unit that makes generally even the amount of code discarding in those encoding 
units that are in mutually adjacent relationship across a tile boundary. 
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RKIFF nFSrKTPTTON OF THF OK AWTNftS 

[0013] Figure 1 is a flowchart for illustrating an embodiment of the present invention; 

[0014] Figure 2 is a schematic block diagram of the system to which the program of one 

embodiment of the present invention is implemented; 

[0015] Figure 3 is a flowchart for illustrating an example of the processing of code 

discarding/packet generation in Figure 1 ; 

[0016] Figure 4 is a flowchart illustrating another example of the processing of code 

discarding/packet generation in Figure 1 ; 

[0017] Figure 5 is a diagram showing an example of division of an image into tiles; 

[0018] Figure 6 is a diagram showing code blocks in a mutually adjacent relationship 

across a vertical tile boundary; 

[0019] Figure 7 is a diagram showing code blocks in a mutually adjacent relationship 

across a horizontal tile boundary; 

[0020] Figure 8 is a diagram showing code blocks in a mutually adjacent relationship 

across four tile boundaries; 

[0021] Figure 9 is a diagram showing an interleaved arrangement of coefficients across a 

vertical tile boundary; 

[0022] Figure 10 is a diagram showing an interleaved arrangement of coefficients across 

a horizontal tile boundary; 

[0023] Figure 1 1 is a diagram showing an interleaved arrangement of coefficients across 

four tile boundaries; 

[0024] Figure 12 is a diagram showing an example of code block division in tile 0; 

[0025] Figure 13 is a diagram showing an example of the code block division in tile 1 ; 
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[0026] Figure 14 is a diagram showing an example of the code block division in tile 2; 

[0027] Figure 15 is a diagram showing an example of code block division in tile 3 

[0028] Figure 16 is a diagram showing a square root of a sub band gain of a 5x3 

transformation; 

[0029] Figure 17 is a diagram showing square root of a gain of a reverse RCT 

transformation; 

[0030] Figure 18 is a diagram showing a square root of a sub band gain of brightness Y 

in consideration of the gain of the reverse RCT transformation; 
[0031] Figure 19 is a square root of the sub band gain of color difference Cb in 

consideration of the gain of the reverse RCT transformation; 

[0032] Figure 20 is a diagram showing square root of the sub band gain of color 

difference Cr in consideration of the gain of the reverse RCT transformation; 

[0033] Figure 21 is a diagram showing an example of a tile image; 

[0034] Figure 22 is a diagram showing a coefficient arrangement obtained by a wavelet 

transformation in the vertical direction of the tile image; 

[0035] Figure 23 is a diagram showing a coefficient arrangement obtained by a wavelet 

transformation in a horizontal direction with respect to the coefficient arrangement of Figure 22; 
[0036] Figure 24 is a diagram showing a deinterleaved arrangement of the coefficient 

arrangement of Figure 23; 

[0037] Figure 25 is a diagram showing a coefficient arrangement in the state after 

deinterleaving obtained by conducting a two-dimension wavelet transformation twice; 
[0038] Figure 26 is a diagram showing a sub band decomposition of decomposition level 

number three; 
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[0039] Figure 27 is a schematic diagram showing the relationship between the tile, sub 

band, precinct, and code block; and 

[0040] Figure 28 is a diagram showing a format of the encoding data of JPEG 2000. 
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DETAILED DESCRIPTION hf thf EREEERRED fm^odimfnts 

[0041] Accordingly, one or more embodiments of the present invention include a novel 

and useful image compression method and apparatus wherein the foregoing problems are 

eliminated. 

[0042] Another and more specific embodiments of the present invention include an 

image compression method and apparatus capable of suppressing the tile boundary distortion 
caused by the code discard processing effectively. 

[0043] An embodiment of the present invention includes an image compression 

apparatus that compresses images by an image-encoding algorithm, where the image-encoding 
algorithm divides an image into plural tiles, decomposes each tile into plural sub bands by 
frequency conversion, and conducts bit-plane encoding for each of the sub-bands. The sub-band 
forms an encoding unit. The image compression apparatus comprises a code discarding unit for 
discarding a code obtained by the bit plane encoding selectively in each encoding unit. The 
code discarding unit includes trimming units for trimming a code discard amount such that the 
code discard amount becomes generally even in the encoding units that are adjacent to each other 
across a tile boundary. 

[0044] Another embodiment of the present invention includes an image compression 

apparatus as noted above, wherein the trimming unit makes generally even the code discard 
amounts for the encoding units that are adjacent to each other across the tile boundary, the 
encoding units forming the same sub band in the respective tiles. 

[0045] Another embodiment of the present invention includes an image compression 

apparatus as noted above, wherein the trimming unit makes generally even the code discard 
amounts between the encoding units adjacent to each other across the tile boundary, the encoding 
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units forming a sub band of a specific decomposition level or lower. 

[0046] Another embodiment of the present invention includes an image compression 

apparatus as noted above, wherein the code discarding unit includes ranking units for providing 
an order of discarding the codes to each bit plane code of each encoding unit according to the 
importance of the bit plane code. The trimming unit changes the order provided to each bit 
plane code by the ranking unit in the encoding units that are adjacent to each other across a tile 
boundary. 

[0047] Another embodiment of the present invention includes an image compression 

apparatus as noted above, wherein the trimming unit comprises calculation units for calculating 
the importance of bit plane codes for encoding units adjacent to each other across the tile 
boundary by treating the encoding units as a single encoding unit. The code discarding unit 
comprises: an importance calculation unit for calculating importance of bit plane codes in each of 
the encoding units; and a ranking unit providing order of discarding codes to each of the bit 
plane codes of the encoding units adjacent to each other across the tile boundary according to the 
importance calculated by the trimming unit. The ranking unit further provides an order of 
discarding codes to each of the bit plane codes of encoding units that are not adjacent to each 
other across the tile boundary, according to the importance calculated by the importance 
calculation unit. 

[0048] Another embodiment of the present invention includes an image compression 

apparatus as noted above, wherein the algorithm of image encoding is an algorithm based on 
JPEG 2000 and wherein a code block is used for the encoding unit. 

[0049] Another embodiment of the present invention includes an image compression 

apparatus as noted above, wherein the algorithm of image encoding is an algorithm based on 
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JPEG 2000 and a code block is used for the encoding unit, and further wherein the encoding unit 
subjected to trimming is an encoding unit of HL sub band and HH sub band. 
[0050] Another embodiment of the present invention includes an image compression 

method for compressing images according to an image encoding algorithm, where the image 
encoding algorithm divides an image into plural tiles, decomposes each tile into plural sub bands 
by frequency conversion, and conducts bit plane encoding to each sub band for each of encoding 
units. The method comprises selectively discarding a code obtained by the bit plane encoding 
for each encoding unit, and discarding code by, at least, in part, trimming the amount of 
discarded code for coding units that are adjacent to each other across a tile boundary. 
[0051] Another embodiment of the present invention includes an image compression 

method as noted above, wherein trimming makes generally even the amount of discarded codes 
in the encoding units that are adjacent to each other across the tile boundary and forming the 
same sub band in the respective encoding units. 

[0052] Another embodiment of the present invention includes an image compression 

method as noted above, wherein trimming makes generally even the amount of discarded codes 

in the encoding units that are adjacent to each other across the tile boundary and forming a sub 

band of a specific decomposition level or lower in the respective encoding units. 

[0053] Another embodiment of the present invention includes an image compression 

method as noted above, wherein the image encoding algorithm is an algorithm based on JPEG 

2000. 

[0054] According to the image compression apparatus or method of the present invention, 

it becomes possible to suppress the tile boundary distortion caused by the effect of code 
discarding at the time of the image compression effectively. Thus, it becomes possible to 
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reproduce high quality images with reduced tile boundary distortion by decompressing the image 

compressed by the image compression apparatus or method of the present invention. Further, 

various effects explained with reference to the embodiments is obtained. 

[0055] Other embodiments and further features of the present invention will become 

apparent from the following detailed description when read in conjunction with the attached 

drawings. 

[0056] Because the present invention is applicable to the case that uses JPEG 2000 as the 

image encoding algorithm, the present invention will be explained for the embodiments that use 
the JPEG 2000 algorithm. It should be noted, however, that the present invention is applicable 
also to the cases that use image encoding methods other than JPEG 2000 as long as it is the 
image encoding method that divides an image into tiles and decomposes each tile to sub band by 
frequency conversion and conduct bit plane encoding of the coefficients of each sub band for 
each of the encoding units and further capable of conducting discarding of the codes according to 
the needs. 

[0057] In order facilitate understanding of the invention, explanation will be made first 

about outline of the algorithm of JPEG 2000. 

[0058] First, compression process will be explained. 

[0059] In the compression process, an image of each color component is divided into 

non-overlapping tiles of a rectangular form, and each of the tiles of each color component thus 
divided is subjected to compression. 

[0060] Here, each tile image of each color component is subjected to a color 

transformation process for the purpose of improving the of compression ratio. In the event the 
image is given in the form of RGB data, for example, an RCT transformation below is applied. 
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Brightness Y = floor ((R+2G+B)/4) 

Color difference Cr = R-G 

Color difference Cb = B-G ( 1 ) 

Further, a reverse RCT transformation is defined as follows. 
R = G + Cr 

G = Y - floor ((Cr + Cb)/4) 
B = Cb + G (2) 

[0061] Here, it should be noted that floor (x) in the above equation is a floor function of 

x (a function substituting a real number x to an integer not exceeding x and closest to x). 
[0062] It should be noted that this color transformation process could be omitted. In that 

case, each RGB component is processed as it is. Further, in the case of a monochrome image, 
such a color transformation is unnecessary. 

[0063] Next, the tile images of each color component are subjected to a two-dimensional 

wavelet transformation (discrete wavelet transformation), and as a result, each tile image is 
decomposed into plural sub bands. Further, the wavelet coefficients are quantized for each sub 
band. In JPEG 2000, both reversible compression (loss-less compression) and irreversible 
compression (lossy compression) is possible. In one embodiment, in the case of the reversible 
compression, the quantization step number is always one, and there occurs no substantial 
quantization in this stage. 

[0064] Here, it should be noted that the wavelet coefficients after the quantization are 

subjected to bit-plane encoding in each sub band, wherein the bit-plane encoding is conducted for 
each encoding unit called a code block starting from the upper bit down to the lower bit. 
[0065] With regards to the codes thus obtained, the code discard processing mentioned 

before is conducted, and there is formed a packet by gathering the remaining codes that were not 
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discarded. Further, the packets are arranged in a desired order and code data of predetermined 

format is formed by adding thereto necessary tags and tag information. 

[0066] Figure 28 shows the format of the generated JPEG 2000 code data. 

[0067] As can be seen in Figure 28, the code data starts with a tag called the SOC marker 

that indicates the beginning of the code data, wherein tag information called the main header that 

describes various encoding parameters and quantization parameters follows the SOC marker. 

Thereafter, the code data for each tile follows. It should be noted that the code data of each tile 

starts with a tag called SOT marker and includes tag information called tile header (Tile Header), 

a tag called SOD marker, and tile data (Tile Data) that represents the contents of the code of each 

tile. Further, a tag called EOC marker indicating the end is provided after the last tile data. Thus, 

the part after the SOD for each tile in Figure 28 forms a set of packets. 

[0068] The decompression process is conducted in a reverse manner to the compression 

processing. 

[0069] Thus, the code data is decomposed into a code series for each tile and for each 

component based on the tag information. By entropy decoding this code series, the wavelet 
coefficients are restored. Because these wavelet coefficients are quantized, they are subjected to a 
reverse quantization process, and after applying a reverse two-dimensional wavelet 
transformation, the image of each tile is reproduced for each color component. The tile images of 
each color component are returned to the RGB data by a reverse RCT transformation of the 
above-mentioned Equation (2). 

[0070] Here, explanation will be made further with regard to the two-dimensional 

wavelet transformation. 

[0071] Figures 21-25 are diagrams for illustrating the process of conducting a wavelet 
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transformation called 5x3 transformation adopted by JPEG 2000 to a monochromatic tile image 
of 16 x 16 picture elements (or a one color component image of a color image) in a vertical 
direction and a horizontal direction. 



[0072] Figure 21 is a tile image before the transformation. 

[0073] In Figure 21, X and Y coordinates are defined as represented, and the pixel value 

for a picture element having a Y coordinate value y is represented as P (y) (0=y= 15). 
[0074] In JPEG 2000, a high-pass filtering is applied first in the vertical direction 



(direction of Y-axis) with regard to the central picture elements having odd Y coordinates 
(y=2i+l), and the coefficients c(2i+l) are obtained. Next, a low-pass filtering is applied with 
regard to the central picture elements having even Y coordinate values (y=2i), and coefficients 
c(2i) are obtained. Further, the foregoing procedure is conducted for all the values of x. 
[0075] Here, the high-pass filtering and the low-pass filtering are represented 

respectively by the Equations (3) and (4) as follows. 



C(2i+l)=P(2i+l)-floor((P(2i)+P(2i+2))/2) • " • • ( 3 ) 
C(2i)=P(2i)+floor((C(2i- 1 )+C(2i+ 1 )+2)/4) " ' ( 4 ) 

[0076] At the edge part of the image, there are cases in which there exist no adjacent 

picture elements with regard to the central picture element. In such a case, the deficient picture 
element value is supplemented by a process called " mirroring." It should be noted that mirroring 
is an operation of folding the pixel values in line-symmetry with regard to the boundary and uses 
the folded pixel values as the pixel values of the adjacent picture elements. 
[0077] By representing the coefficients obtained by the high-pass filtering as H and the 

coefficients obtained by the low pass filtering as L, the image of Figure 21 is transformed to the 
array of the L coefficients and H coefficients shown in Figure 22 as a result of the transformation 
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in the vertical direction. 

[0078] Next, high-pass filtering is applied to the array of the coefficients of Figure 22 in 

the horizontal direction, wherein the high-pass filtering is applied with regard to the central 
coefficients having odd X coordinate values (y=2i+l), followed by a low-pass filtering applied 
with regard to the central picture elements having even X coordinate values (x=2i). Further, the 
foregoing process is conducted to all the y values. In this case, P(2i) and the like of the foregoing 
equations are regarded as representing the coefficient values. 

[0079] Designating the coefficients obtained by the low pass filtering about a central L 

coefficient as LL, the coefficients obtained by the high-pass filtering about a central L coefficient 
as LL, the coefficients obtained by the low pass filtering about a central H coefficient as LH, and 
the coefficients obtained by the high-pass filtering about a central H coefficient as HH, the array 
of the coefficients of Figure 22 is transformed to the array of coefficients as represented in Figure 
23. Here, it should be noted that the coefficients represented by the same symbol form a group 
called a sub band. In the case of Figure 23, there exist four sub bands. 
[0080] As a result of the foregoing process, one wavelet transformation (one 

decomposition process) is completed. Figure 24 shows an array of the wavelet coefficients in 
which the wavelet coefficients are gathered in each of the sub bands. It should be noted that the 
process of forming such an array of coefficient is called deinterleaving, while the process of 
forming the arrangement as represented in Figure 23 is called interleaving. 
[0081] Further, a second wavelet transformation is performed similarly by regarding the 

LL sub band as an original image. As a result of deinterleaving of such a result, the sub band 
coefficients as represented in Figure 25 are obtained. In Figures 24 and 25, it should be noted that 
the numeral 1 or 2 prefixed to the coefficients represent the number of the wavelet 
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transformations (decomposition level) performed until that coefficient values are obtained. 
[0082] Figure 26 shows an example of the sub band decomposition for the case in which 

the decomposition level is three. In Figure 26, it should be noted that the numerals bracketed in 
each sub band represents a resolution level. 

[0083] In the case of a reverse wavelet transformation, on the other hand, a reverse low 

pass filtering is applied to the interleaved coefficient array shown in Figure 23 first in the 
horizontal direction such that the reverse low pass filtering is applied with regard to the central 
coefficients having an even x value (x=2i). Next, a reverse high-pass filtering is applied with 
regard to the coefficients having odd x values (x=2i+l). Further, the foregoing processing is 
conducted for all the y values. 

[0084] Here, it should be noted that the reverse low pass filtering and the reverse high 

pass filtering are conducted respectively according to Equations (5) and (6) as follows. 

P(2i)=C(2i)-floor ((C(2i-l)+C(2i+l)+2)/4) •••• (5) 
P(2i+l)=C(2i+l)+floor ((P(2i)+P(2i+2))/2) •••• (6) 

[0085] Similarly as before, there are cases in which a central coefficient lacks an adjacent 

coefficient at an edge part of an image. In such a case, the coefficient values are supplemented by 
the mirroring processing. 

[0086] With this, the coefficient array of Figure 23 is transformed back to the coefficient 

array of Figure 22. 

[0087] Next, a reverse low pass filtering is applied in the vertical direction about the 

central coefficients having even y values (y=2i), followed by a reverse high-pass filtering about 
the central coefficients having odd y values (y=2i + 1). By conducting the same to all the x values, 
one reverse wavelet transformation is completed and the coefficient array of the original image of 
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Figure 21 is restored. In other words, the original image is reconstructed. 
[0088] In the case where the wavelet transformation is performed plural times, too, it is 

possible to regard the arrangement of Figure 21 as a LL sub band and repeat a similar reverse 
transformation by using other coefficients such as HL. 

[0089] Figure 27 shows schematically the relationship between the tile, precinct and 

code block for the case the decomposition number is three. 

[0090] It should be noted that precinct means a rectangular division of a sub band and 

roughly indicates the location in the image. The size of the precinct can be specified by a user 
and may be set equal to the size of the sub band. The precincts of the three sub bands, HL, LH 
and HH, are treated as a group. 

[0091] Further, it should be noted that a packet is formed by gathering a part of the codes 

of all the code blocks included in the precinct (such as the codes of three bit planes starting from 
the uppermost bit down to the third bit). A packet free form a code is also allowed. 
[0092] Heretofore, it was explained that each sub band coefficient is encoded in each bit 

plane unit in each code block. In JPEG 2000, the bit plane is actually classified into three sub bit 
planes, and the encoding is conducted for each of the sub bit planes. 

[0093] Thus, the minimum unit of the code discarding process is the sub bit plane. On 

the other hand, in the case it is desired to simplify the code discarding process, it is also possible 
to conduct the code discarding process in each bit plane by using the bit plane as the minimum 
unit of code discarding process. Thus, in the description hereinafter, explanation will be made for 
the case of discarding the codes while using the bit plane as the minimum unit of code discarding 
process. 

[0094] Figure 1 is a flowchart for illustrating an embodiment of the image processing 
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method according to one embodiment of the present invention. 

[0095] The image compression process of the present embodiment uses a JPEG 2000 

algorithm as the image encoding algorithm , and thus, the image compression process includes, 
as represented in Figure 1, a step 100 for performing color transformation of the images, a step 
102 for applying a two-dimensional wavelet transformation to each tile image of each component, 
a step 104 for quantizing the wavelet coefficients in each of the sub bands, a step 106 of bit plane 
encoding each sub band coefficient to the code block (encoding unit) after quantization, a step 
108 for the code discarding and packet generation, and a step 1 10 for forming encoded data of a 
predetermined format by using the packet. 

[0096] Here, the color transformation step 100 is conducted according to the needs as 

mentioned before. Further, the division of the image into tiles conducted before or after the color 
transformation step 100 is omitted in Figure 1 . It should be noted that the primary feature of the 
image compression process of the present invention is the code discarding processing in the step 
108, wherein this code discarding step will be explained later in detail. 
[0097] Figure 1 also shows an explanation of the embodiment of image compression 

apparatus of the present invention. Thus, the image compression apparatus of the present 
embodiment includes the processing unit corresponding to each of the steps shown in Figure 1, 
wherein it should be noted that there is provided the unit for conducting the step 108 for 
discarding the codes. 

[0098] It should be noted in one embodiment that the unit corresponding to the 

processing step of Figure 1 is materialized by a computer program. Thus, the program code, the 
processor-readable medium on which the program code is recorded, such as various recording 
(storage) media including magnetic disks, optical disks, magneto-optical disks, semiconductor 
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memory elements, and also a computer especially configured by the computer-readable program 
code, are also included in the scope of the invention. 

[0099] Figure 2 is a schematic block diagram for illustrating the mode of one 

embodiment of the present invention that uses a computer. 

[00100] Referring to Figure 2, the image compression apparatus includes a CPU 200, a 

memory 201 used by the CPU 200 as a work memory area, a hard disk drive 202 used for storing 
data or program, and a system bus 203. Thus, the program for realizing the processing step or 
image compression processing is loaded to the memory 201 from the hard disk apparatus 202, 
and the CPU 200 carries out the processing. 

[00101] The following summarizes the compression processing. 

(D The image data 205 to be processed and stored in the hard disk device 202 is 
read to the region 206 of the memory 201 in response to a command from the CPU 200. 

® The CPU 200 reads the image data from the memory and applies the 
compression processing shown in Figure 1 . 

® The CPU200 writes the encoded data obtained as a result of the processing 
into a different region 207 of the memory 201. 

[00102] Further, by a command from the CPU 200, this encoding data is stored in the hard 

disk device 202. 

[00103] Figure 3 is a flowchart illustrating an example of the processing of the step 108 or 

the construction corresponding to the foregoing processing. Another example of processing or 
construction corresponding thereto will be described later with reference to Figure 4. 
[00104] In the step 300 of Figure 3, the importance of the codes of each bit plane is 

calculated for each code block according to the equation below. 
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importance = (increment of quantization error caused by discarding of codes) x /" Gs/(code 



amount) 



(7) 



[00105] 



For the "code amount" of Equation (7), the code amount obtained and saved at 



the time of the bit plane encoding (step 106) is used. 



[00106] 



With regard to the "increment of quantization error caused by discarding of the 



codes," this can be obtained by various methods, wherein D. S. Taubman and M. W. Marcellin, 
op cit. provides a mathematically rigorous process. In the present specification, the increment of 
quantization error caused by the discarding of the code will be obtained for the code of the nth bit 
plane as measured from the lowermost bit according to the relationship 

(increment of quantization error caused by the discarding of the code)= (2 A (n- 1 )-2 A (n-2)) x 
(number of the coefficients in the code block). (8) 

[00107] It should be noted that V" Gs of Equation (7) is a square root of the sub band gain 

Gs of the sub band in which the code block is included and a predetermined value predetermined 
for each sub band is used. The wavelet coefficients decoded from the codes are converted to the 
pixel values after conducting the reverse quantization and reverse wavelet transformation, 
wherein the quantization error caused by discarding the codes is reflected to the pixel values with 
a factor different from each other according to the sub bands. The square of this factor is called 
sub band gain. Thus, the quantization error A e of the wavelet coefficient is multiplied with the 
square root of the sub band gain as a result of the reverse wavelet transformation and takes the 
value of 

[00108] While detailed explanation is omitted, it should be noted that the square root of 

the sub band gains of the sub bands LL, HL, LH and HH of the decomposition levels 1 and 2 
takes a value as represented in Figure 16 in the case the wavelet transform called 5 x 3 noted 
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before is used. In the case of compressing a monochromatic image without color transformation, 
the values of Figure 16 are used actually for the term of Equation (7). 
[00109] On the other hand, in the case of compressing the image of RGB data after color 

transformation to the YCbCr color system by the RCT transformation of Equation (2) and 
restoring the RGB data at the time of decompression by conducting the reverse RCT 
transformation of Equation (2), it is necessary to use the sub band gain such that the sub band 
gain takes into consideration the gain of the reverse RCT transformation. 
[00110] While detailed explanation is not provided, the square root of the gain of the 

reverse RCT transformation takes the values represented in Table 17. Thus, in the case of 
compressing an RGB color image formed of each of the R, G and B color components after 
conversion to the Y, Cr and Cb components by the RCT transformation, the calculation of the 
importance of the codes for each of the Y, Cr and Cb components is conducted by using a value 
in which the value shown in Figure 16 (square root gain of the reverse wavelet transformation) is 
multiplied with the value shown in Figure 17 for the term -f Gs of Equation (7) as represented in 
Figures 18, 19 and 20. 

[00111] Next, in the step 302, each bit plane code in each code block is provided with a 

ranking or an order of code discarding process for the entire sub bands according to the order of 
importance obtained in the previous step, starting from the code of smallest importance, wherein 
this ranking process is conducted for each of the tiles consecutively or simultaneously for the 
entire tiles and hence the entire image. This ranking may be conducted by sorting of the codes 
according to the importance or may be conducted by the procedure that uses the Lagrange's 
method of undetermined multipliers, as described in D. S. Taubman and M. W. Marcellin, op. cit. 
In the case of the color images, similar ranking of the codes is achieved simultaneously for the 
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entire color component codes. 

[00112] Conventionally, discarding the code is conducted in the order of the provided 

rank , starting from the least significant bit plane, until a desired compression ratio (code 
amount) is reached. 

[00113] In such a code discarding procedure, there is a possibility that a significant 

difference appears in the ranking provided to the codes between the code blocks adjacent to each 
other across the tile boundary, irrespective of whether the ranking is conducted consecutively for 
each tile or simultaneously to the entire tiles, and as a result, there arises a possibility of 
occurrence of large difference in the code discarding amount of the code blocks, in other words, a 
large difference in the number of the bit planes in which the codes are discarded, between the 
code blocks adjacent to each other across the tile boundary. In such a case, there appears a large 
difference in the quantization error between the code blocks adjacent to each other across the tile 
boundary, and this has become the cause of the tile boundary distortion. 
[00114] The present invention suppresses the tile boundary distortion caused by such a 

mechanism by changing the code discard amount for the code blocks adjacent to each other 
across a tile boundary in the step 304 preceding the code discarding step such that the code 
discarding amount becomes equal or generally equal between the code blocks that are adjacent to 
each other across the tile boundary. 

[00115] For example, consider the case in which one of the code blocks adjacent to each 

other has the ranking indicating that the codes of ten bit planes are to be discarded while the 
other code block has the ranking that the codes of only six bit planes are to be discarded. In such 
a case, the present invention modifies the ranking of the former code block such that there are 
seven bit planes that are to be discarded in place of the ten bit planes. 
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[00116] After such a correction of ranking or order, the step 306 is conducted in which the 

codes are discarded according to the modified order, starting from the lowermost bit plane, until 
the desired code discard amount is reached. 

[00117] Further, in the step 308, the remaining codes are gathered together to form a 

packet. 

[00118] Thus, according to the image compression method of the present embodiment, the 
step 108 for discarding the codes includes the steps 300 and 302 for providing the ranking of 
code discarding according to the importance of the codes and also the step 304 for modifying the 
given rank or order for making generally even the code discarding amount between the code 
blocks adjacent to each other across a tile boundary. 

[00119] In other words, according to the image compression apparatus of the present 

embodiment, the unit for discarding codes corresponding to the step 108 includes a unit 
corresponding to the steps 300 and 302 for providing the order of code discarding according to 
the importance of the code and further the unit corresponding to the step 304 for modifying the 
provided order in order to make generally even the code discard amount between the code blocks 
that are adjacent to each other across the tile boundary. 

[00120] With regard to the relationship between the code blocks adjacent to each other 

across a tile boundary, an example shown in Figure 5 is considered in which an image is divided 
into four tiles 0-3. 

[00121] In such a case, there can exist a situation shown in Figure 6 in which two code 

blocks A and B are adjacent to each other laterally across a vertical tile boundary, a situation 
shown in Figure 7 in which two code blocks A and B are adjacent to each other vertically across 
a horizontal tile boundary, and a situation shown in Figure 8 in which four code blocks A, B, C 
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and D are adjacent to each other across a tile boundary. Here, it should be noted that the mutually 
adjacent relationship under consideration is the relationship existing between the code blocks of 
the sub bands of the same decomposition level. In the case of the color images, the mutually 
adjacent relationship between the code blocks is considered for each of the color components. 
[00122] For the code blocks that are adjacent to each other across a time boundary, there 

exist a code block set called "type 1" including the coefficients that are literally adjacent to each 
other across the tile boundary in the state the wavelet coefficients are interleaved, in other words 
the corresponding picture elements are actually adjacent to each other across the tile boundary, 
and also a code block set called "type 2" in which the code blocks of the same sub band are 
adjacent to each other across the tile boundary. It should be noted that the present invention 
encompasses any of the modes in which the code blocks of the type 1 are chosen, the code blocks 
of the type 2 are chosen or the code blocks of both of the types 1 and 2 are chosen, for the code 
block set adjacent to each other across the tile boundary. 

[00123] Hereinafter, the code block set of the types 1 and 2 will be explained in more 

detail with reference to Figures 9 through 15. 

[00124] As will be understood easily from Figure 23, there appears a coefficient array 

shown in Figure 9 at the vertical tile boundary in the interleaved state of the coefficient. Thus, in 
the case of choosing the type 1 set, the code block set having the code block of the HL sub band 
and including therein the coefficients 500 and the code block of the LL sub band including 
therein the coefficient 501, and also the code block set having the code block of the HH sub band 
including therein the coefficients 503 and the code block of the LH sub band including therein 
the coefficients 504, are chosen as the code block set that are adjacent to each other. 
[00125] In the case of the type 2 set, there is a general tendency that the quantization error 
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associated with discarding of code becomes larger in the HL coefficients and the HH coefficients 
as compared with the LL coefficients or the LH coefficients. Thus, it is reasonable to provide 
emphasis on the code blocks of the HL sub band and HH sub band when considering the 
mutually adjacent relationship between the code blocks. 

[00126] Thus, in the case of adopting the type 2 set, the code block set of the code block 

of the HL sub band including the coefficients 500 and the code block of the HL sub band 
including the coefficients 505, and also the code block set of the code block of the HH sub band 
including the coefficients 503 and the code block of the HH sub band including the coefficients 
506 are chosen as the code block set of the code blocks that are adjacent to each other. Thus, in 
this case, the cod blocks adjacent to each other across the tile boundary are the code blocks of the 
same sub band. 

[00127] Similarly, in the state in which the coefficients are interleaved, there appears the 

coefficient array shown in Figure 10 at the tile boundary extending in the horizontal direction. 
[00128] Thus, when the type 1 set is used, the code block set having the code block of the 

HH sub band including the coefficients 510 and the code block of the HL sub band including the 
coefficients 5 1 1 , and the code block set having the code block of the LH sub band including the 
coefficients 513 and the code block of the LL sub band including the coefficients 514, are chosen 
as the code block set in which the code blocks are adjacent to each other. When the type 2 set is 
used, the code block set having the code block of the HH sub band including the coefficients 510 
and the code block of the HH sub band including the coefficients 512, and the code block set 
having the code block of the LH sub band including the coefficients 513 and the code block of 
the LH sub band including the coefficients 515, are chosen as the code block set in which the 
code blocks are adjacent to each other. In this case, it should be noted that the code block set in 
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which the code blocks adjacent to each other across the tile boundary form the same sub band. 
[00129] Similarly, in the interleaved state of the coefficients, there appears a coefficient 

array as indicated in Figure 1 1 at the boundary of the four tiles. Thus, in the case the type 1 set is 
used, the code block set of four code blocks respectively including the coefficients 520, 521, 522 
and 523, is chosen as the code block set of the code blocks that are adjacent toto each other. 
When the type 2 set is used, the code block set of four code blocks respectively including the 
coefficients 522, 523, 524 and 525, are chosen for the code block set of the code blocks that are 
adjacent to each other. 

[00130] Figures 12-15 show the sub bands corresponding to the tiles 0 - 3 of Figure 5. 

[00131] Referring to the drawings, the wavelet transform is conducted down to the 

decomposition level 2, and as a result, each sub band is divided into code blocks for encoding as 
represented in the drawing by broken lines. Here, it should be noted that the precinct size is set 
identical to the sub band size. 

[00132] Below, the mutually adjacent relationship of the code blocks at the decomposition 

level 2 will be explained with reference to these drawings. 

[00133] In the case the coefficients 500 are included in the code block 600 of the tile 0 

(see Figure 12), the coefficients 505 are included in the code block 605 of the tile 1 (see Figure 
13). Further, the coefficients 503 are included in the code block 603 (see Figure 12) and the 
coefficients 506 are included in the code block 605 of the tile 1 (Figure 13). Thus, the set of the 
code blocks 600 and 605 and the set of the code blocks 603 and 600 are chosen as the code block 
set in which the code blocks are in mutually adjacent relationship across a vertical tile boundary 
(type 2). 

[00134] In the case the coefficients 5 10 of Figure 10 are included in the code block 610 of 
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the tile 0 (Figure 12), the coefficients 512 are included in the code block 612 of the tile 2 (Figure 
14). Thus, the set of the code blocks 610 and 612 is chosen as the code block set in which the 
code blocks are adjacent to each other across a horizontal tile boundary (type 2). 
[00135] Further, the coefficients 522 of Figure 1 1 are included in the code block 622 of 

the tile 0 (Figure 12), the coefficients 524 are included in the code block 624 of the tile 1 (Figure 
13), the coefficients 523 are included in the code block 623 of the tile 2 (Figure 14), and the 
coefficients 525 are included in the code block 625 of the tile 3 (Figure 15). Thus, the set of the 
code blocks 622, 623, 624 and 625 is chosen as the code block set in which the code blocks are 
adjacent to each other (type 2). 

[00136] In average, there is a tendency, in view of the fact that the code discarding 

processing is conducted starting from the sub band of low decomposition level, that the 
difference of code discarding amount between the adjacent code blocks across a tile boundary 
appears particularly conspicuous for the codes of low decomposition level. 
[00137] Thus, in the present embodiment, the modification of the order of code discarding 

processing in the step 304 for making generally even the code discarding amount may be 
conducted only for those codes of the sub bands having a decomposition level lower than a 
predetermined level (such as the decomposition level 1) while maintaining sufficient effect for 
suppressing the tile boundary distortion. With such a procedure, the amount of processing can be 
reduced also. Thus, it should be noted that the present invention also includes such a mode of 
implementation. 

[00138] Next, another example of the processing of the step 108 of Figure 1 or another 

example of the construction of the corresponding processing to this step will be explained with 
reference to the flowchart of Figure 4. 
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[00139] Referring to Figure 4, a code block is selected in the step 400 and examination is 

made in the step 404 whether the pertinent code block is in a mutually adjacent relationship with 
another code block across a tile boundary. Explanation has been made already with regard to this 
mutually adjacent relationship. 

[00140] In the case there exists no other code blocks in the mutually adjacent relationship, 

the step 406 is conducted in which the importance of the code is calculated for each bit plane of 
the pertinent code block. This process of obtaining the importance is explained already with 
reference to the step 300 of Figure 3. After this calculation, the process returns to the step 400 for 
selection of the next code block. 

[00141] In the case there exists another code block in mutually adjacent relationship with 

the selected code block across a tile boundary (Yes in step 404), the step 408 is conducted in 
which the two or four code blocks in the mutually adjacent relationship are gathered together so 
that they are treated as a single code block. Further, calculation is made about the importance of 
the code for each bit plane. This procedure of calculating the importance is conducted similarly 
to the step 406 except that the sum of the code amount of each bit plane of the two or four code 
blocks is used for the denominator of Equation (7) and a cum of the values (product of the 
increment of the quantization error and the square root of the sub band gain) obtained for each bit 
plane of the two or four code blocks is used for the numerator of Equation (7). Thereafter, the 
process returns to the step 400 for selection of the next code block. 

[00142] When the calculation of importance of the code is completed for all of the code 

blocks (Yes in step 402), the step 410 is conducted in which the rank or order of discarding is 
provided to the code of each bit plane in each of the code blocks, starting from the code having 
the least significant importance. 
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[00143] Further, in the step 412, the codes are discarded starting from the lower most bit 

plane according to the rank or order of discarding, until a desired code amount is reached. Finally, 
in the step 414, the remaining codes not discarded are gathered to form a packet. 
[00144] In the step 408, it should be noted that those codes of the code blocks that are in 

mutually adjacent relationship across a tile boundary and belonging to the same bit plane are 
regarded as having the same importance, and thus, these code blocks are provided with the same 
rank. Thus, the code discarding amount becomes more or less even between these code blocks. 
With this, the quantization error associated with the code discarding processing becomes also 
even more or less, and the tile boundary distortion is suppressed. 

[00145] Thus, in one embodiment of the image compression method of the present 

embodiment explained herein, the step 108 for discarding the codes includes, in addition to the 
step 406 for calculating the importance of the codes for each bit plane in the code blocks not in 
mutually adjacent relationship across a tile boundary, but also the step 408 for calculating the 
importance of the code for each bit plane with regard to the set of the code blocks that are in the 
mutually adjacent relationship in order to make even the code discarding amount more or less, by 
treating these two or four code blocks as a single code block. Further, there is provided a step 
410 that provides the order of code discarding for the codes of the code blocks not in the 
mutually adjacent relationship with each other according to the importance calculated by the step 
406 and the order of code discarding for the codes of the code blocks that are in the mutually 
adjacent relationship with each other according to the importance calculated in the step 408. In 
other words, the image compression apparatus of this mode of invention includes the unit for 
discarding the code corresponding to the step 108, the unit for calculating the importance 
corresponding to the step 406 and also the unit for calculating the importance corresponding to 
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the step 408 and further the units corresponding to the step 410. 

[00146] As explained with reference to the step 304 of Figure 3, there is a tendency that 

an observable difference appears in the code discarding amount between the code blocks that are 
adjacent to each other across a tile boundary, particularly in the decomposition level is low. Thus, 
it is possible to apply the process of the step 408 only to the code blocks of the sub band having a 
decomposition level lower than a predetermined level, such as the decomposition level 1 in the 
cease the number of the decomposition is 2. The present invention includes also such a mode. 
[00147] Further, the present invention is by no means limited to the embodiments 

described heretofore, but various variations and modifications may be made without departing 
from the scope of the invention. 
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